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DETAILED ACTION 
Response to Amendment 

1 . In response to the Office Action mailed September 1 , 2004, applicant submitted 
an amendment filed on December 1, 2004, in which the applicant traversed and 
requested reconsideration with respect to claims 1, 5, 8 and 14. 

Response to Arguments 

2. Applicant(s) argue that the Digalakis reference does not teach concatenating 
lattices. Instead applicant(s) argue that the Digalakis reference is tying together HMM's. 
Digalakis teaches that each node in the tree that is generated during the clustering 
procedure corresponds to a set of states, with the leaves of the tree corresponding to 
single HMM states (column 12, lines 56-59). These different states of the lattices tied 
together are concatenation. Therefore, applicant's arguments are not persuasive. 

Applicant(s) also argue that Digalakis does not teach applying at least one 
language model to the single concatenated lattice in order to determine a relationship 
between the lattices. Applicants traverse this comparison and submit that Digalakis 
merely teaches creating output word lattices from the speaker-independent recognizer 
with a bigram language model. However, Digalakis teaches that the relative 
contribution of the two stages (relationship) is evaluated (column 13, lines 8-50). 
Therefore, applicant's arguments are not persuasive. 

Applicant(s) argue that the Reynar reference does not teach applying at least 
one language model to the single concatenated lattice in order to determine a 
relationship between the lattices. However, Reynar teaches that a stochastic model 
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(node also stores probability assigned to a certain word or phrase) is used to obtain n- 
best results (column 8, lines 1-17 with lines 32-54). Therefore applicant's arguments 
are not persuasive. 

Applicant(s) argue regarding the dependent claims 2-7, 9-13 and 15-16 that there 
is no motivation or suggestion, either in the references themselves, or in the knowledge 
generally available to one of ordinary skill in the art, to combine the references. 
However, the examiner disagrees. The examiner recognizes that obviousness can only 
be established by combining or modifying the teachings of the prior art to produce the 
claimed invention where there is some teaching, suggestion, or motivation to do so 
found either in the references themselves or in the knowledge generally available to one 
of ordinary skill in the art. See In re Fine, 837 F.2d 1071 , 5 USPQ2d 1596 (Fed. Cir. 
1988) and In re Jones, 958 F.2d 347, 21 USPQ2d 1941 (Fed. Cir. 1992). In this case, 
the motivation to do so is found in the reference, in which a column and line number is 
cited. 

Applicant(s) argue regarding claim 5 that the Mohri reference is assigned to 
AT&T Corp. and the present application is assigned to AT&T Corp. Applicant's 
arguments, see pages 11-12, filed December 1, 2004, with respect to 5 have been fully 
considered and are persuasive. The rejection of claim 5 has been withdrawn. 
However, upon further consideration, a new ground(s) of rejection is made in view of 
Morin et al., Flanagan et al. and L'Esperance et al. 
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Claim Rejections - 35 USC § 102 

3. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

4. Claims 1, 8 and 12-14 are rejected under 35 U.S.C. 102(b) as being anticipated 
by Digalakis et al. (U.S. Patent No. 5,864,810), hereinafter referenced as Digalakis. 

Regarding claims 1, 8 and 14, Digalakis discloses a method, process and 
controller of rescoring the results of automatic speech recognition (ASR), hereinafter 
referenced as ASR: method, comprising: 

generating lattices for speech utterances (column 1 1 , lines 40-44); 

concatenating the lattices (clustering engine; 4, element 12) into a single 
concatenated lattice (create a set of tied models; column 6, lines 45-53 with column 12, 
lines 56-59); and 

applying at least one language model (language model) to the single 
concatenated lattice in order to determine relationships between the lattices (column 
13, lines 8-50). . 

Regarding claim 12, Digalakis discloses the ASR method wherein the speech 
recognition model is a hidden Markov model (column 4, line 60 - column 5, line 20). 
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Regarding claim 13, Digalakis discloses the ASR method wherein the controller 
is a network server (figures 4 and 5). 

5. Alternately claims 1, 8 and 12-14 are rejected under 35 U.S.C. 102(b) as being 
anticipated by Asano et al. (USPN 5,848,389), hereinafter referenced as Asano. 

Regarding claims 1, 8 and 14, Asano discloses a method, process and 
controller of rescoring the results of automatic speech recognition (ASR), hereinafter 
referenced as ASR method, comprising: 

generating lattices (lattices) for speech utterances (speech recognition result in 
form of lattice; column 5, lines 21-37); 

concatenating the lattices into a single concatenated lattice (lattices are 
combined; column 7, lines 1-12); and 

applying at least one language model (language model) to the single 
concatenated lattice in order to determine relationships (occurrence probability where a 
certain word may be connected to another certain word) between the lattices (column 
11, lines 7-22). 

Regarding claim 12, Asano discloses the ASR method wherein the speech 
recognition model is a hidden Markov model (column 6, lines 15-37). 

Regarding claim 13, Asano discloses the ASR method wherein the controller is a 
network server (finite state network; column 1 1 , lines 58-60). 
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6. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351(a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

7. Alternately claims 1, 8 and 13-14 are rejected under 35 U.S.C. 102(e) as being 
anticipated by Reynar.et al. (U.S. Patent No. 6,581,033), hereinafter referenced as 
Reynar. 

Regarding claims 1, 8 and 14, Reynar discloses a method, process and 
controller of rescoring the results of automatic speech recognition (ASR), hereinafter 
referenced as ASR method, comprising: 

generating lattices for speech utterances (column 8, lines 43-46); 

concatenating the lattices (concatenation process) into a single concatenated 
lattice (pieces combined into a larger lattice; column 8, lines 32-54); and 

applying at least one language model (language model) to the single 
concatenated lattice in order to determine relationships between the lattices (column 8, 
lines 1-17 with lines 33-54). 

Regarding claim 13, Reynar discloses the ASR method wherein the controller is 
a network server (figure 1). 
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Claim Rejections - 35 USC § 103 

8. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

9. Claims 2, 6-7, 9, 11 and 16 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Reynar in view of Thrasher et al. (U.S. Publication No. 
2002/0052742), hereinafter referenced as Thrasher. 

Regarding claim 2, Reynar discloses the ASR method of rescoring the results of 
automatic speech recognition, but lacks further comprising generating a confidence 
score. 

Thrasher discloses the ASR method comprising: 

generating a confidence score (confidence measure; column 3, paragraphs 0035 
and 0036) after applying the at least one speech recognition model (language model; 
figure 2, element 110), to determined whether the generated lattices are acceptable 
(identify improperly identified, column 3, paragraphs 0035 and 0036). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Reynar's method such that it generates a 
confidence score, to identify which patterns are most likely to have been improperly 
identified by the recognizer (column 3, paragraph 0035). 
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Regarding claims 6, 9 and 16, Reynar discloses the ASR method, but lacks 
wherein the rescoring the automatic speech recognition is used in a mobile 
communications system. 

Thrasher discloses the ASR method wherein the rescoring the automatic speech 
recognition is used in a mobile communications system, wireless communication 
(column 2, paragraph 0024), to relay information. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Reynar's method such that the ASR is used in a 
mobile communications system, to provide the user with alternatives to the speech 
recognition output provided by the engine (column 1, paragraph 0002). 

Regarding claims 7 and 11, Reynar discloses the ARR method, but lacks 
wherein rescoring the automatic speech recognition is used in a satellite 
communications system. 

Thrasher discloses the ASR method wherein rescoring the automatic speech 
recognition is used in a satellite communications system (satellite dish; column 2, 
paragraph 0022), to relay information. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Reynar's method such that the ASR is used in a 
satellite communications system, to provide the user with alternatives to the speech 
recognition output provided by the engine (column 1 , paragraph 0002). 



Application/Control Number: 09/966,081 Page 9 

Art Unit: 2655 

10. Claims 3-4 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Reynar in view of Thrasher, as applied to claim 2, in further view of Waibel et al. (U.S. 
Patent No. 5,712,957), hereinafter referenced as Waibel. 

Regarding claim 3, Reynar in view of Thrasher, as applied to claim 2 above, 
discloses the ASR method of rescoring the results of automatic speech recognition, but 
lacks wherein the confidence score is compared to a predetermined value. 

Waibel discloses the ASR method wherein the confidence score (confidence 
score) is compared to a predetermined value (predetermined threshold value) in order 
to determine whether to perform the automatic speech recognition process again 
(repeat again; column 1, lines 56-59), to avoid incorrect recognition. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Reynar in combination with Thrasher's method 
such that the confidence score is compared to a predetermined threshold as in Waibel, 
to repair misrecognition of speech (column 1, lines 9-12). 

Regarding claim 4, Reynar in view of Thrasher, as applied to claim 2 above, 
discloses the ASR method, but lacks wherein the automatic speech recognition 
process is performed again if the confidence score is less than the predetermined 
value. 

Waibel discloses the ASR method wherein the automatic speech recognition 
process is performed again if the confidence score is less than the predetermined 
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value (until the score is above the threshold; column 1, lines 56-59), to avoid incorrect 
recognition. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Reynar in combination with Thrasher and 
Waibel's method invention wherein the automatic speech recognition process is 
performed again if the confidence score is less than the predetermined value as in 
Waibel, to repair misrecognition of speech (column 1, lines 9-12). 

1 1 . Claim 5 is rejected under 35 U.S.C. 103(a) as being unpatentable over Reynar in 
view of Morin et al. (USPN 6,41 1,927), hereinafter referenced as Morin. 

Regarding claim 5, Reynar discloses the ASR method, but lacks wherein the 
rescoring is performed after a speech recognition model has been compensated to 
reflect acoustic environmental data transducer data. 

Morin discloses the ASR method wherein the rescoring is performed after a 
speech recognition model (speech models; column 2, lines 1-10) has been 
compensated (figure 1, element 15) to reflect acoustic environmental data and 
transducer data (figure 1 with take into account the microphone and its associated 
acoustic environment; column 3, lines 28-35), for signal equalization. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Reynar' s method wherein the rescoring is 
performed after a speech recognition model has been compensated to reflect acoustic 
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environmental data and transducer data as in Morin, to obtain signal equalization for 
normalizing a time domain source signal to a target environment (column 1, lines 7-12). 

12. Alternately Claim 5 is rejected under 35 U.S.C. 103(a) as being unpatentable 
over Reynar in view of Flanagan et al. (USPN 5,737,485), hereinafter referenced as 
Flanagan. 

Regarding claim 5, Reynar discloses the ASR method, but lacks wherein the 
rescoring is performed after a speech recognition model has been compensated to 
reflect acoustic environmental data transducer data. 

Flanagan discloses the ASR method wherein the rescoring is performed after a 
speech recognition model (speech recognition) has been compensated to reflect 
acoustic environmental data (compensate for environmental variations) and transducer 
data (microphone; figure 1 with column 3, line 47 - column 4, line 4 and column 6, line 
56 - column 7, line 6), to produce high "hands-free" identification scores. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Reynar's method wherein the rescoring is 
performed after a speech recognition model has been compensated to reflect acoustic 
environmental data and transducer data as in Flanagan, to produce high "hands-free" 
identification scores, even under hostile condition of reverberation, and low SCNR's 
caused by interfering noise. 
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13. Claim 5 is rejected under 35 U.S.C. 103(a) as being unpatentable over Reynar in 
view of L'Esperance et al. (U.S. Publication No. 2002/0055844), hereinafter referenced 
as L'Esperance. 

Regarding claim 5, Reynar discloses the ASR method, but lacks wherein the 
rescoring is performed after a speech recognition model has been compensated to 
reflect acoustic environmental data transducer data. 

L'Esperance discloses the ASR method wherein the rescoring (obtains a score 
for each model) is performed after a speech recognition model (speech recognition) 
has been compensated to reflect acoustic environmental data (various acoustic 
environments) and transducer data (figure 1 with column 3, paragraphs 0037-0041 and 
column 1, paragraph 0013), to provide essentially the same level of accuracy. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Reynar' s method wherein the rescoring is 
performed after a speech recognition model has been compensated to reflect acoustic 
environmental data and transducer data as in L'Esperance, to train models to operate 
in different situations/circumstances (column 3, paragraph 0038). 
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14. Claim 10 is rejected under 35 U.S.C. 103(a) as being unpatentable over Reynar 
in view of Pan et al. (U.S. Patent No. 6,304,844), hereinafter referenced as Pan. 

Regarding claim 10, Reynar discloses the ASR method, but lacks wherein the 
speech utterances are received from a personal digital assistant (PDA). 

Pan discloses the ASR method wherein the speech utterances are received from 
a personal digital assistant (column 12, lines 47-50 and column 13, lines 1-13), to avoid 
redesign or reprogramming of the DSP. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Reynar's method wherein the speech 
utterances are received from a PDA as in Pan, thus allowing easy, quick, and 
inexpensive integration, avoiding redesign or reprogramming of the DSP. 

15. Claim 15 is rejected under 35 U.S.C. 103(a) as being unpatentable over Reynar 
in view of Waibel. 

Regarding claim 15, Reynar discloses the ASR method of rescoring the results 
of automatic speech recognition, but lacks comprising a fourth section that determines 
whether an automatic speech recognition process should be performed again. 

Waibel discloses the ASR method comprising a fourth section that determines 
whether to perform the automatic speech recognition process again (repeat again; 
column 1, lines 56-59), to avoid incorrect recognition. 



Application/Control Number: 09/966,081 



Page 14 



Art Unit: 2655 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify Reynar's method such that it determines 
whether an automatic speech recognition process should be performed again, to repair 
misrecognition of speech (column 1, lines 9-12). 



16. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Jakieda R Jackson whose telephone number is 
571 .272.761 9. The examiner can normally be reached on Monday through Friday from 
7:30 a.m. to 5:00p.m. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David Ometz can. be reached on 571 .272.7593. The fax phone number for 
the organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-21 7-91 97 (toll-free). 



Conclusion 



JRJ 

April 28, 2005 
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